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Abstract 

Plants and algae possess plastids and mitochondria harboring their own genomes, which are replicated by the apparatus consisting 
of DNA polymerase, DNAprimase, DNAhelicase, DNAtopoisomerase, single-stranded DNA maintenance protein, DNAIigase, and 
primer removal enzyme. In the higher plantArabidopsis thaliana, organellar replication-related enzymes (OREs) are similar in plastids 
and mitochondria because many of them are dually targeted to plastids and mitochondria. In the red algae, there is a report about a 
DNA replicase, plant/protist organellar DNA polymerase, which is localized to both plastids and mitochondria. However, other OREs 
remain unclear in algae. Here, we identified OREs possibly localized to organelles in the unicellular rhodophyte Cyanidioschyzon 
merolae. We then examined intracellular localization of green fluorescent protein-fusion proteins of these enzymes in C. merolae, 
whose cell has a single plastid and a single mitochondrion and is suitable for localization analysis, demonstrating that the plastid and 
the mitochondrion contain markedly different components of replication machinery. Phylogenetic analyses revealed that the 
organelle replication apparatus was composed of enzymes of various different origins, such as proteobacterial, cyanobacterial, and 
eukaryotic, in both red algae and green plants. Especially in the red alga, many enzymes of cyanobacterial origin remained. Finally, 
on the basis of the results of localization and phylogenetic analyses, we propose a model on the succession of OREs in eukaryotes. 

Key words: evolution of replication apparatus, mitochondria and plastids, subcellular localization, GFP-fusion proteins, red 
algae. 



Introduction 

Plants and algae possess plastids and mitochondria. Both or- 
ganelles contain their own genome to perform their metabolic 
functions. The organellar genomes are thought to be repli- 
cated by the machinery including DNA polymerase, DNA pri- 
mase, DNA helicase, DNA topoisomerase, single-stranded 
DNA binding protein (SSB), DNA ligase, and primer removal 
enzyme (Langston et al. 2009). As plant organellar genomes 
do not encode these enzymes, the organellar genomes must 
be replicated by the enzymes that are encoded by the nuclear 
genome and transported to the organelles after their synthesis 
(Wang et al. 1997). 

In Escherichia coli, DNA polymerase III (Pol III) holoenzyme 
consisting of ten subunits functions in DNA elongation in rep- 
lication. The catalytic a subunit (encoded by the dnaE gene) of 
Pol III belongs to Family-C DNA polymerase, and it is conserved 
in all bacteria (Langston et al. 2009; Sanyal and Doig 2012). 



DnaB helicase unwinds double-stranded DNA and then SSB 
maintains the DNA unwound. DNA gyrase, which consists of 
two subunits, gyrase A and B, is a type II DNA topoisomerase, 
and alleviates the strain resulting from DNA unwinding. DnaG 
synthesizes an RNA primer at the origin of replication in the 
leading strand and synthesizes primers in about every 1 kb in 
the lagging strand. Ribonucleotides within the RNA primer are 
removed by nick translation with 5'-3' exonuclease and poly- 
merase activity of DNA polymerase I (Pol I), and the nicked 
DNA is sealed by NAD + -dependent DNA ligase LigA. 

In humans, the mitochondrial genome is replicated by a 
replisome consisting of DNA polymerase y (Poly), human 
mitochondrial RNA polymerase (POLRMT), TWINKLE helicase 
(T7 gp4-like protein with intramitochondrial nucleoid locali- 
zation), topoisomerases 1 and 3a, SSB, ligase 3, and RNase 
H1 (Arnold et al. 2012; Kasiviswanathan et al. 2012). Poly is 
responsible for the replication and repair in animal 
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mitochondrial genomes, and belongs to Family-A DNA poly- 
merase, which shares distant sequence similarity to bacterial 
Pol I. Animal Poly comprises two subunits: a large subunit 
with DNA polymerase and 3'-5' exonuclease activities, and 
a small subunit that enhances processivity and primer recog- 
nition (Kaguni 2004). POLRMT is a homolog of the RNA po- 
lymerase of T3/T7 phage (RPOT), which is a single polypeptide 
enzyme. POLRMT had been thought to function in transcrip- 
tion, but recently this RNA polymerase was demonstrated to 
function also as a primase (Wanrooij et al. 2008). In plants, 
the homolog of POLRMT is called RPOT type, and RPOT was 
localized to plastids and/or mitochondria for transcription 
(Kuhn et al. 2007). TWINKLE is a homolog of the gp4 protein 
of T7 phage, which has helicase and primase activities. 
TWINKLE is widely conserved in eukaryotes including animals, 
bikonts (plants and protists), and amoebozoa (Shutt and Gray 
2006). Animal TWINKLE shows only helicase activity and re- 
tains a nonfunctional domain of primase. Mitochondrial repli- 
some was reconstituted with Poly, TWINKLE, and SSB, and 
the resulting replisome showed rolling-circle replication with 
high processivity (Korhonen et al. 2004). Processivity is de- 
fined as the number of nucleotides added by a DNA polymer- 
ase per one binding to the template DNA, and in general, 
replicative DNA polymerase has a high processivity value, and 
repair polymerase has a low processivity value. 

Recently, in higher plant Arabidopsis thaliana, enzymes re- 
lated to organellar genome replication have been identified. 
The genome of Arabidopsis encodes two genes, each encod- 
ing a DNA polymerase belonging to Family-A DNA polymer- 
ase, which are called Pol l-like or Poly (Christensen et al. 2005; 
Mori et al. 2005; Parent et al. 201 1; Cupp and Nielsen 2013). 
These polymerases function in replication in both plastids and 
mitochondria. Subsequently, we reported that these polymer- 
ases are also conserved in many protists, and the enzymes 
are phylogenetically distinct from bacterial Pol I and Poly. 
Because it was thought that a new name is needed, we pro- 
posed to call them POP (plant/protist organellar DNA 
polymerase, Moriyama et al. 2011; Moriyama and Sato 
201 3). Arabidopsis thaliana has a single gyrase A that is local- 
ized to both plastids and mitochondria, and has two gyrases 
B1 and B2 that are localized to chloroplasts and mitochondria, 
respectively. Inhibitor assay and complement test in E. coli 
revealed that A. thaliana gyrases are involved in organellar 
genome replication (Wall et al. 2004). Arabidopsis thaliana 
has an A-type topoisomerase I (AtTOPI), which is a homolog 
of bacterial topoisomerase I, TopA. Localization analysis using 
green fluorescent protein (GFP) fusion protein showed dual 
localization of AtTOPI to chloroplasts and mitochondria 
(Carrie et al. 2009). In addition to this report, by proteome 
analysis, AtTOPI was detected in the chloroplast nucleoids in 
A. thaliana (Olinares et al. 2010). Arabidopsis thaliana 
TWINKLE is localized to both chloroplasts and mitochondria 
(Carrie et al. 2009), and has both helicase and primase activ- 
ities unlike animal TWINKLE retaining only helicase activity 



(Diray-Arce et al. 2013). Recombinant Arabidopsis SSB 
(AtSSB) binds to single-stranded DNA (ssDNA), but not to 
double-stranded DNA, and AtSSB-GFP protein is localized to 
mitochondria (Edmondson et al. 2005). Organellar SSB (OSB), 
which has a SSB-like domain, is a plant-specific SSB. 
Arabidopsis thaliana has four OSBs: AtOSBI and 2 are local- 
ized to mitochondria, chloroplasts, respectively, and AtOSB3 is 
localized to both chloroplasts and mitochondria (Zaegel et al. 
2006). Replication protein A (RPA), a nucleus-localized SSB in 
eukaryotes, comprises three subunits, RPA70, RPA32, and 
RPA14. Rice has three RPA70s, three RPA32s, and one 
RPA14. These subunits make three types of complexes, 
namely, type A, B, and C complexes with different combina- 
tions of the three types of subunits. Among them, type A RPA 
complex is localized to chloroplasts in rice (Ishibashi et al. 
2006). DNA ligase 1 (LIG1) is localized to both mitochondria 
and nucleus, but not to plastids, and plastidial ligase remains 
unclear (Sunderland et al. 2006). In bacteria, Pol I acts in 
primer removal with its 5'-3' exonuclease activity. The 
Arabidopsis genome encodes two 5'-3' exonuclease genes 
(5'-3'EX01 and 2) having sequence homology to 5'-3' exonu- 
clease domain of bacterial Pol I, and these enzymes were pre- 
dicted to be localized to chloroplasts or mitochondria (Sato 
et al. 2003). 

Previously, we demonstrated that a rhodophyte 
Cyanidioschyzon merolae has two organellar DNA polymer- 
ases, POP and Pol I (Moriyama et al. 2008). By immuno- 
blot analysis using isolated organelles and observation of 
GFP-fusion proteins in onion epidermal cells, we showed 
that CmPOP was localized to both plastids and mitochondria, 
while CmPol I was localized to plastids. We also deter- 
mined enzymatic activity of these polymerases. CmPOP 
showed the high processivity value (> 1,300 nt) and had 
3'-5' exonuclease activity, while CmPol I showed middle 
level processivity (<70nt), and had no 3'-5' exonuclease ac- 
tivity (Moriyama et al. 2008). Other organellar replication- 
related enzymes (OREs) have not been identified in red and 
green algae. 

The unicellular rhodophyte C. merolae has a remarkably 
simple cell structure consisting of a single mitochondrion 
and a single plastid per cell. Its normal habitat is hot springs, 
which is warm (up to 50 °C) and acidic (pH 1.5-2.5) with 
sulfuric acid. The size of its completely sequenced genome is 
16,546,747 bp, with 4,775 predicted protein-coding genes 
(Matsuzaki et al. 2004; Nozaki et al. 2007). The method of 
transformation by polyethylene glycol (PEG) was recently 
established in C. merolae (Ohnuma et al. 2008). Subcellular 
localization of GFP-fusion protein in C. merolae was first dem- 
onstrated by Watanabe et al. (2011). They constructed the 
pCG1 vector, in which the target protein-GFP fusion gene is 
overexpressed by C. merolae apcC promoter. 

In the present study, we identified the components of 
OREs, and analyzed their organellar localization by observation 
of GFP-fusion protein. We also estimated the origin of OREs 
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by phylogenetic analysis. Finally, we discuss a model on the 
succession of organellar replisome in eukaryotes. 

Materials and Methods 

Culture Conditions 

Cells of C. merolae strain 10D (Toda et al. 1998) were inocu- 
lated in the 2 x Allen's medium (Minoda et al. 2004) at pH 
2.5. Flasks were shaken under continuous light provided by 
two fluorescent tubes (30 nmol/m 2 /s) at 40 °C. 

Construction of GFP-Fusion Genes 

Genes of putative OREs were amplified by PCR with specific 
primer sets (supplementary table S1, Supplementary Material 
online). Using In-Fusion HD cloning kit (Clontech Laboratories, 
Mountain View, CA), the amplified DNA fragments were in- 
serted intoXbal-cut pCG1 vector (Watanabe et al. 201 1) that 
contains the apcC gene promoter of C. merolae, sGFP, and 
NOS terminator. 

Transformation in C. merolae Cells 

Cells of C. merolae were transformed by the PEG-method 
according to Ohnuma et al. (2008) except for the preparation 
of recipient cells. Subcultured cells were grown to 
OD 750 = 3-5 with aeration by 1 % C0 2 , and collected by cen- 
trifugation at 4,000 x g for 5min at 40°C. The precipitated 
cells were resuspended in 2 x Allen's medium to OD 750 = 1 1- 
12. The concentrated culture was subjected to darkness for 
12 h with aeration by 1% C0 2 at 40 °C in a special culture 
apparatus that was assembled from materials for gel electro- 
phoresis. Namely, two 0.8-mm silicone tubes (0.8-mm wall, 
Bio-Rad Laboratories, Hercules, CA) were inserted between 
the two glass plates with a silicone spacer (clearance of 
2 mm) for aeration (supplementary fig. S1, Supplementary 
Material online). The cells were grown in this apparatus 
under high light provided by two 20-W krypton bulbs 
(250 nmol/m 2 /s) for 12— 16 h. To 100|il of the culture taken 
from the apparatus was added to 1 50 nl of a DNA mixture 
(1 5 of 1 0 x MA-I [Ohnuma et al. 2008], 1 00 [ig of salmon 
sperm DNA, and 50 \ig of plasmid DNA), and then 250 jxl of 
60% PEG solution in MA-I was added to the cell-DNA mix- 
ture. Three minutes later, the mixture was diluted to 40 ml 
with 2 x Allen's medium, and cultured overnight at 40 °C 
with aeration by 1 % C0 2 . 

Microscopic Examination 

For observation of fluorescence of GFP, transformed cells 
were collected by centrifugation at 200xg for 10min, and 
examined by a fluorescence microscope BX-60 (Olympus, 
Tokyo, Japan), using the cube U-MNIBA (Olympus). When 
GFP fluorescence was faint, we performed immunofluores- 
cence detection using anti-GFP antibody, essentially according 
to Nishida et al. (2004). In the reaction with antibody, an 



anti-GFP monoclonal antibody was diluted to 1/200 with 
immunoreaction enhancer solution (Can Get Signal immunos- 
tain solution B, TOYOBO, Osaka, Japan) and reacted for 1 h. 
As the secondary antibody, anti-mouse monoclonal antibody 
labelled with Alexa fluor 488 (Invitrogen, Carlsbad, CA) was 
diluted to 1/200 with immunoreaction enhancer solution, and 
reacted for 30min. The immunostained cells were observed 
by the fluorescence microscope with the cube U-MNIBA 
(Olympus) for Alexa fluor 488, the cube U-MWIG (Olympus) 
for observation of autofluorescence of chlorophyll, and the 
cube U-MWU (Olympus) for observation of DAPI-stained 
DNA. Nomarski differential interference image was also 
recorded. All microscopic images were captured by a digital 
camera (model DP-70, Olympus). 

Phylogenetic Analysis 

Sequence data of homologous proteins of C. merolae OREs 
were obtained from the Gclust database (Sato 2009, version 
2010-05, data set Gclust2010e29b). The sequence of 17 gp4 
was obtained from Genbank. Amino acid alignments were 
prepared by the MUSCLE software version 3.6 (Edgar 2004). 
Manipulation of sequence data were performed by the SISEQ 
software version 1 .59 (Sato 2000). The alignments were used 
to infer phylogeny by the Maximum Likelihood method using 
the TreeFinder software version March 201 1 (Jobb et al. 2004) 
with the WAG model with "Empirical" option. 

Results 

Replication-Related Genes Whose Products Are Possibly 
Localized to Organelles in C. merolae Genome 

Table 1 is a list of putative components of OREs in C. merolae, 
selected by Gclust database search (Sato 2009) using replica- 
tion enzymes in bacteria and OREs in higher plants and ani- 
mals as references (supplementary table S2, Supplementary 
Material online). Cyanidioschyzon merolae genome encoded 
homologs of most OREs of A. thaliana, except OSB, which 
is conserved only in higher plants (Zaegel et al. 2006). 
Cyanidioschyzon merolae has a putative DnaG primase 
(encoded by the nuclear genome) and DnaB helicase (encoded 
by the plastid genome), which are bacterial enzymes that are 
not encoded by the genomes of higher plants and animals. 
Cyanidioschyzon merolae also has a homolog of bacterial Pol I 
harboring two domains, 5'-3' exonuclease and DNA polymer- 
ase, while green plants have an enzyme containing a single 
5'-3' exonuclease domain. Cyanidioschyzon merolae has two 
eukaryotic topoisomerase II, CMB01 3C and CML330C, which 
we named TOP2a and TOP2b. 

Three computer programs, TargetP (http://www.cbs.dtu. 
dk/services/TargetP/, last accessed January 22, 2014), 
WolfPSORT (http:/AA/ww.genscript.com/psort/wolf_psort. 
html, last accessed January 22, 2014), and Predotar (http:// 
urgi.versailles.inra.fr/predotar/predotar.html, last accessed 
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Table 1 

List of Replication-Related Enzymes Possibly Localized to Plastid or Mitochondrion in C. merolae and Summary of Localization Analysis 



Annotation 



Gene Identifier 



Phylogenetic Origin 



Prediction Results 



GFP 



TargetP 



PSORT 



Predotar 



DNA polymerase 

POP CMO270C 
Priming 

DnaG CMQ286C 

DNA helicase 

DnaB (pt-encoded) CMV098C 

Primase/helicase 

TWINKLE CMT452C 

Topoisomerases 

Gyrase A CMS243C 

Gyrase B CMH166C 

Topoisomerase I (type IA) CMI252C 

Topoisomerase I (type IB) CMM263C 

Topoisomerase II a CMB013C 

Topoisomerase II b CML330C 

Topoisomerase III alpha CMG056C 

Topoisomerase VI A-1 CME071C 

Topoisomerase VI A-2 CML010C 

Topoisomerase VI A-3 CMQ111C 

Topoisomerase VI A-4 CMR274C 

Topoisomerase VI B CMT273C 

ssDNA maintenance 

SSB CMI135C 

RPA 70kDa CMC123C 

RPA 30kDa CMI291C 

Ligation 

DNA ligase CMK235C 
Primer removal 

DNA Pol I CMT462C 

RNase HI large subunit CMK297C 

RNase HII CMT626C 

DNA2 CMK133C 

FEN1 CMG106C 



Unknown 

Cyanobacterial 

Cyanobacterial 

Eukaryotic 

Cyanobacterial 

Cyanobacterial 

Cyanobacterial 

Eukaryotic 

Eukaryotic 

Eukaryotic 



a-Proteobacterial 

Eukaryotic 
a-Proteobacterial 
Bacterial (unknown origin) 



mt 



mt 



mt 

None 

Pt 

Pt 

None 

Pt 
mt 

mt 

None 

mt 

mt 

mt 

None 

mt 

mt 
None 

mt 

mt 

None 

mt 

mt 

None 



Pt 
mt 

Pt 

Pt 
Pt 
Pt 

nuc 
Pt 

cyto 

Pt 

Pt 

Pt 

Pt 

Pt 

nuc 

mt 

Pt 
nuc 

Pt 

Pt 

nuc 
Pt 

pt (nuc) 
Pt 



ER 

None 



mt 

ER 

None 

None 

None 

None 

None 

mt 

None 

None 

None 

None 

None 

mt 

mt 
None 

None 

mt 

None 

mt 

None 

mt 



pt, mt 
Pt 

(pt a ) 
mt 

pt 

Pt 

Pt 

nuc 

cyto b 

mt 



cyto D 



mt 
nuc 
nuc 

pt, mt 

Pt 

mt 
nuc 
nuc 



Note. — Predicted results of localization consistent with the GFP results are marked by bold italic. Abbreviations: pt, plastid; mt, mitochondrion; nuc, nucleus; cyto, cytosol; 
ER, endoplasmic reticulum. 

a No GFP analysis was done because dnaB gene is encoded by the plastid genome. 

b ln this study, N-terminal peptides were fused with GFP for targeting analysis, and cytosol localization was observed in these enzymes. However, these enzymes have a 
nuclear localization signal within the protein sequence after the N-terminus. 



January 22, 2014) were used for the prediction of subcellular 
localization of putative OREs. The results were considerably 
different among the three programs (table 1). We also 
made alignments of putative OREs and determined whether 
putative OREs have an N-terminal extension for organellar 
targeting (supplementary fig. S2-1 to S2-9, Supplementary 
Material online). On the basis of these results, we selected 
proteins for analysis of subcellular targeting. 

Intracellular Localization of Putative OREs in C. merolae 

The C. merolae cell has a simple cell structure with a single 
plastid and a single mitochondrion, and reproduces itself 



by binary fission (fig. 1/4). The N-terminal extension sequence 
of each putative ORE was cloned into pCG1 vector for 
overexpression of its GFP-fusion protein driven by the apcC 
promoter (fig. 16). The amino acid sequences of putative 
OREs fused with GFP are given in supplementary figure S2, 
Supplementary Material online. The plasmids were introduced 
into C. merolae cells by the PEG method, and GFP fluores- 
cence was observed (fig. 1C-f and supplementary fig. S3, 
Supplementary Material online). When GFP fluorescence in 
the nucleus-like structure was faint, the protein targeting 
was confirmed by immunostaining using anti-GFP antibody 
with 4',6-diamidino-2-phenylindole (DAPI) staining (supple- 
mentary fig. S3, Supplementary Material online). 
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Fig. 1. — Localization of ORE-GFP fusion proteins in C. merolae cell. (A) Fluorescence images with DAPI staining of an interphase cell and a dividing cell. 
Nuc; Nucleus, Mt; Mitochondrion, Pt; Plastid. (B) Schematic representation of the constructs expressing GFP fusion proteins under the control of the apcC 
promoter of C. merolae (PapcQ and the A/05 terminator (nos). Inserts corresponding to transit peptide regions (TP) were cloned into the pCGI vector. (C-£) 
Fluorescence microscopic images of the transiently transformed C. merolae cells. Images of OREs localized to plastid (Q, mitochondrion (D), or both plastid 
and mitochondrion (£) are shown. 



The DNA replicase, POP, was dually targeted to plastid and 
mitochondrion (fig. 1£), which is consistent with our previous 
results by immunoblotting (Moriyama et al. 2008). There are 
two methionine residues in the N-terminus region of LIG1 
(supplementary fig. S2-1, Supplementary Material online). 
A construct starting from the first methionine residue in 
LIG1 showed strong GFP-fluorescence in the mitochondrion 
and weak fluorescence in the plastid, whereas another con- 
struct starting from the second methionine residue showed 
GFP-fluorescence in the plastid (fig. 1£). LIG1 might be trans- 
lated from the two different methionine residues. 

TWINKLE, which had helicase/primase activity and was 
localized to both plastids and mitochondria in A. thaliana, 
was observed only in mitochondrion in C. merolae (fig. 1D). 
DnaG primase, which was conserved in red algae and not in 



green plants, was localized to plastid (fig. 10- In C. merolae, 
DnaB helicase is encoded by the plastid genome, and the 
enzyme is supposed to be localized to plastid. Among DNA 
topoisomerases, two subunits (A and B) of gyrase and type-IA 
topoisomerase I (TOP1) were localized to the plastid, 
while topoisomerase ll-b (TOP2b) was localized to the mito- 
chondrion (fig. 1C and D). Other tested topoisomerases, 
namely, TOP1 (type IB), TOP2A, and TOP6, were not observed 
in either plastid or mitochondrion (supplementary fig. S3, 
Supplementary Material online). The C. merolae genome en- 
codes five enzymes related to primer removal, RNase HI and 
HII, DNA2, FEN1, and Pol I that has 5' -3' exonuclease and 
polymerase domains. The GFP-fusion results suggested that 
Pol I and RNase HII are localized to plastid and mitochondrion, 
respectively (fig. 1Cand D). DNA2 and FEN1 were observed 
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in the nucleus in C. merolae (supplementary fig. S3, 
Supplementary Material online). 

The fluorescence of SSB-GFP was observed only in the mi- 
tochondrion (fig. 1D). We also tested a construct starting from 
the second methionine residue in SSB, and the construct 
showed nonorganellar localization (data not shown). In rice, 
an RPA was detected in isolated plastids, by immunoblotting, 
as well as in nuclei (Ishibashi et al. 2006). However, C. merolae 
RPAs have no extension sequence at its N-terminus, and these 
were localized to the nucleus (Supplementary fig. S3, 
Supplementary Material online). No other gene for ssDNA 
binding protein is found in the C. merolae genome. RNA bind- 
ing protein may function also as a ssDNA binding protein. It is 
known that some RNA binding proteins, such as TIA-1 and 
TIAR in humans, bind to ssDNA (Suswam et al. 2005). 

To examine localization of enzymes translated from native 
start codon(s) including non-AUG start codon, we first 
observed using constructs, authentic promoter:full-length 
protein:GFP. However, no fluorescent signal of these con- 
structs was observed even if immunofluorescence using 
anti-GFP antibody was performed. Next, we prepared con- 
structs overexpressed by apcC promoter, namely, apcC pro- 
moter:full-length protein:GFP. Also in these constructs except 
SSB, GFP-fluorescence could not be observed. Overexpressed 
SSB-GFP was observed as a single granule in mitochondrion, 
not in nucleus in C. merolae cell (data not shown). This gran- 
ular structure might occur by artifact with overexpression of 
full-length SSB. Subcellular localization of other full-length 
OREs could not be determined yet, and we do not discard 
the possibility that some of the proteins we found in this study 
that went to only mitochondria, plastids, or the nucleus could 
still be dual-localized. 

Phylogenetic Analysis of OREs 

Excepting some clearly eukaryotic enzymes, such as POP, type 
IB and IIB topoisomerases, and DNA ligase I, we analyzed the 
phylogenetic origin of OREs. With respect to the POP, we 
previously reported that the origin of POP was unclear be- 
cause POP did not originate from Pol I of cyanobacteria nor 
a-proteobacteria (Moriyama et al. 2008). 

Four types of simplified trees of OREs are shown in figure 2, 
and original trees are also shown in supplementary figure S4-1 
to S4-6, Supplementary Material online. DnaG and 
DnaB, which are not conserved in green plants, originated 
from cyanobacteria (fig. 2A and supplementary fig. S4-1, 
Supplementary Material online). Gyrases A and B originated 
from cyanobacteria in red algae and green plants (fig. 26 
and supplementary fig. S4-2, Supplementary Material 
online). Type IA topoisomerase originated from 
cyanobacteria in red algae, but the enzyme was obviously 
related to a-proteobacterial homologs in green plants 
(fig. 2C and supplementary fig. S4-3, Supplementary 
Material online). Red algal and green plant SSB originated 
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Fig. 2. — Simplified phylogenetic tree of OREs originating from bacte- 
ria. In rhodophytes, enzymes of cyanobacterial origin are DnaB, DnaG (A), 
gyrases (6), and TOP1 (Q. Enzymes of a-proteobacterial origin are SSB and 
Pol I (D). Detailed phylogenetic trees are shown in supplementary figure 
S4, Supplementary Material online. 



from the SSB in a-proteobacteria (fig. 2D). Animal SSB is 
also a-proteobacterial origin (supplementary fig. S4-4, 
Supplementary Material online). Because green plants have 
5'-3' exonuclease but not a full-domain of Pol I consisting of 
5'-3' exonuclease, 3'-5' exonuclease, and DNA polymerase 
domain, we examined phylogenetic relationship of the 5'-3' 
exonuclease domain in bacteria and photosynthetic eu- 
karyotes. The results showed that exonucleases in the red 
and green lineages are monophyletic, and their origin is 
a-proteobacterial exonuclease domain (fig. 2D and supple- 
mentary fig. S4-4, Supplementary Material online). Red 
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algae and green plants have bacterial RNase HII, but their 
RNase HII was not related to RNase HII in a-proteobacteria 
nor cyanobacteria. The enzyme might have been acquired 
by lateral gene transfer from bacteria such as Chlamydiae, 
Firmicutes, or Chlorobi (supplementary fig. S4-5, 
Supplementary Material online). We also performed phyloge- 
netic analysis of TWINKLE. The TWINKLE of C. merolae and 
diatoms are closely related to that of animals and ciliates 
rather than to that of plants. It was reported that animal 
TWINKLE does not have primase activity because some key 
amino acid residues for primase activity are not conserved 
in animal TWINKLE. In C. merolae, the key residues are con- 
served, and C. merolae TWINKLE is assumed to possess 
primase activity (Shutt and Gray 2006). 

Discussion 

Components of Replication Enzymes in Organelles 

In the present study, we identified the intracellular localization 
of OREs in C. merolae, namely, six plastid-localized OREs, four 
mitochondrion-localized OREs, and two dually localized OREs. 
In C.merolae, ORE components of plastid differed significantly 
with those of mitochondrion except dually localized ones, POP 
and LIG1 . This is in contrast with the situation in A thaliana, in 
which components of OREs in plastids and mitochondria are 
nearly identical. 

Dual localization of POP was found in both the red alga 
C. merolae (Moriyama et al. 2008) and the green plants, 
A thaliana (Christensen et al. 2005) and tobacco (Ono et al. 
2007). In contrast, localization of other OREs is variable in 
various species of red algae and green plants. POP consists 
of a single polypeptide and showed high processivity, al- 
though POP belongs to Family A-type DNA polymerase that 
has sequence similarity to E. coli Pol I having a middle-level 
processivity (Moriyama et al. 2008). POP also showed a high 
fidelity with its 3'-5' exonuclease activity (Takeuchi et al. 
2007). These properties of POP suggest that POPs are widely 
conserved organellar replicase in eukaryotes including 
plants, amoebozoa, alveolates, heterokonts, and discristates 
(Moriyama and Sato 2013). 

TWINKLE is identified as a DNA helicase in human mito- 
chondria (Spelbrink et al. 2001). In A thaliana, AtTWINKLE 
showed dual activity, primase and helicase, and this TWINKLE 
was thought to be involved in the replication of genomes in 
plastids and mitochondria in green plants (Diray-Arce et al. 
2013). In C. merolae, however, TWINKLE is targeted to only 
mitochondria. Instead, single domain enzymes originating 
from cyanobacteria, DnaB helicase and DnaG primase, 
are localized to the plastid. A rhodophyte Porphyridium 
purpureum, and diatoms Thalassiosira pseudonana and 
Phaeodactylum tricornutum also have a plastid-encoded 
DnaB and a nucleus-encoded DnaG (supplementary 
fig. S4-1, Supplementary Material online). We confirmed 



that Porphyridium DnaG was localized to plastid in trans- 
formed C. merolae cells (data not shown). These results sug- 
gest that rhodophytes and diatoms (NB: the latter is thought 
to originate from the secondary endosymbiosis involving a 
rhodophyte) retain DnaB and DnaG of cyanobacterial origin, 
and these enzymes function in plastids, whereas the TWINKLE 
consisting of helicase and primase domains functions in 
mitochondria. 

Cyanidioschyzon merolae has 12 genes encoding various 
subunits of topoisomerases, among which gyrases and bacte- 
rial TOP1 are localized to plastids, while eukaryotic TOP2b is 
localized to mitochondria. Organellar localization of eukary- 
otic TOP2 has not been reported in plants. In C. merolae, the 
effects of a gyrase-specific inhibitor, nalidixic acid, were re- 
ported by two groups (Itoh et al. 1 997; Kobayashi et al. 2009). 
In these reports, nalidixic acid arrests not only the replication of 
plastid genome but also that of mitochondrial and nuclear 
genomes. Considering the plastid localization of gyrases in 
C. merolae, these results may suggest that the arrest of plastid 
replication influences mitochondrial and nuclear replication 
by an unknown mechanism. In another possible explanation, 
nalidixic acid might inhibit mitochondrial and/or nuclear topo- 
isomerases in C. merolae. 

Evolution of Enzymes Involved in Organellar Genome 
Replication in Photosynthetic Organisms 

Previously, we proposed a model for the succession of orga- 
nellar DNA polymerase in eukaryotes (Moriyama et al. 201 1; 
Moriyama and Sato 2013). In the present report, addition of 
other OREs completes the entire view of the model (fig. 3). 
First, when the ancestor of eukaryotes acquired mitochondria, 
the genome of the protomitochondrion had been replicated 
with the a-proteobacterial replication machinery, but then the 
host cell started to use a distinct set of replicase, helicase, 
and primase. Namely, Pol III was replaced by POP, and DnaB 
and DnaG were replaced by TWINKLE. The origins of POP and 
TWINKLE are still unknown. In animals, POP was replaced by 
Poly whose origin is estimated as a T-odd phage DNA poly- 
merase (Filee et al. 2002). In addition, a recent finding that 
some cyanophages have a homolog of Poly suggests that Poly 
might originate from a cyanophage enzyme (Chan et al. 
2011). Moreover, in animals, TWINKLE lost primase activity 
and animal mitochondria started to use POLRMT (T3/T7 
phage type RNA polymerase) for priming on replication in 
addition to transcription. SSB is a sole ORE of the a-proteo- 
bacterial origin in animals. After the second endosymbiosis 
that engendered the plastids in the common ancestor of 
Viridiplantae and Rhodophyta, cyanobacterial Pol III was 
replaced by POP, originally functioning in mitochondria. 
Plastids of rhodophytes continued to use DnaB and DnaG of 
cyanobacterial origin, while plastids of green plants started 
to use TWINKLE in place of DnaB and DnaG. Pol I of the 
a-proteobacterial origin was retained in photosynthetic 
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Fig. 3. — Schematic illustration of the processes leading to succession of OREs in eukaryotes. Boxes consisting of six small boxes indicate component 
enzymes of organellar replication, and explanatory notes for each location of boxes are shown at the lower left corner of this figure. Organelle names, 
mitochondria (Mt), and plastid (Pt), are indicated in the upper left corner of each box. The enzymes that originated from a-proteobacteria and cyanobacteria 
are shown in orange and blue boxes, respectively. Eukaryotic enzymes, whose origin is unclear, are shown in pink boxes. Poly is conserved only in 
opisthokonts including animals and fungi, and the enzyme is shown in a purple box. Bacterial enzymes derived from unknown origin are shown in gray 
boxes. A land-plant-specific enzyme, OSB, is shown in green boxes. ssDNA binding protein in plastids of rhodophytes was not identified in the present study, 
and the box is indicated by a question mark. 



eukaryotes; however, its 3'-5' exonuclease and DNA polymer- 
ase domains were lost in land plants, and in rhodophytes, 3'-5' 
exonuclease activity was lost. 

Accordingly, in land plants, components of replication en- 
zymes had been exchanged between plastids and mitochon- 
dria, and are now homogeneous in these organelles. In red 
algae, however, components of OREs are essentially different 
between plastid and mitochondrion. Plastid in red algae 
retains many OREs originating from cyanobacteria. Among 



these cyanobacteria-derived OREs, DnaB is encoded by the 
plastid genome, while others are encoded by the nuclear 
genome in red algae. The same is true in diatoms. The mech- 
anism of replication in the plastid genome in the red lineages 
might need to retain plastid-encoded dnaB gene for helicase 
activity or its nucleotide sequence. Cooperative function of 
DnaB and DnaG in replication and protein-protein interaction 
of these enzymes have been reported in £ coli (Tougu and 
Marians 1996), and this relationship may be conserved in red 
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lineages. In the next step of research, functional analysis of 
replication apparatus in each organelle will be important. 
Reconstitution experiments in vitro of the organellar DNA rep- 
lication by a combination of various ORE proteins will provide 
clues for the problem. 

Supplementary Material 

Supplementary tables S1, S2, and figures S1-S4 are available 
at Genome Biology and Evolution online (http:/A/vww.gbe. 
oxfordjournals.org/). 
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